Non-Additivity of the Entanglement of Purification 
(Beyond Reasonable Doubt) 
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We demonstrate the convexity of the difference between the regularized entanglement of purifica- 
tion and the entropy, as a function of the state. This is proved by means of a new asymptotic protocol 
to prepare a state from pre-shared entanglement and by local operations only. 

We go on to employ this convexity property in an investigation of the additivity of the (single-copy) 
entanglement of purification: using numerical results for two-qubit Werner states we find strong ev- 
idence that the entanglement of purification is different from its regularization, hence that entangle- 
ment of purification is not additive. 



I. INTRODUCTION 

It is well understood that entanglement plays a key 
role in quantum information science. The best known 
applications of quantum entanglement, like superdense 
coding [1] and quantum teleportation f^, demonstrate 
this amply. The theory of quantum entanglement, which 
aims at quantifying entanglement, has been developed 
greatly during the past several decades. For a bipar- 
tite pure state ip AB = \ijj){ijj\ AB , the von Neumann en- 
tropy of the reduced state, S(A) = — Tr ip A log tp A pro- 
vides the unique measure of entanglement where ip = 
Tip \ip)(ip\ AB . It is denoted E(ip), and this number quan- 
tifies the asymptotically faithful conversion rate of many 
copies of ip into maximally entangled qubit pairs, and 
vice versa j^j. For mixed state, this asymptotic reversibil- 
ity is lost, in general the so-called distillable entangle- 
ment is strictly smaller than the entanglement cost; see 
the recent survey (3] for these facts and pointers to the 
vast literature on entanglement quantification. 

Motivated by entanglement theory, Terhal et at |5|] 
proposed a measure of total (i.e. encompassing both 
quantum and classical) correlations in a quantum state, 
called entanglement of purification. 

Definition 1 Given a bipartite density matrix p AB on A®B, 
the entanglement of purification (EoP) is 

E P (p) :=mmE^)(ip\ AA ' :BB ' 

s.t. t\) AA ' BB ' purification of p AB , 

where E [\'^)('4'\ AA ' :BB '^j = S(AA') is the entanglement of 
the pure state tp across the bipartite cut AA' : BB'. 

That the above is really a minimum and not just an infi- 
mum follows from the fact that w.l.o.g. the dimensions 
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of A' and B' are bounded in terms of \A\ and \B\ |||]. 
deed, in 16|] it was shown that one may assume 
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Entanglement of purification is a genuine measure of 
total correlation in a bipartite state: it is non-negative, 
vanishes precisely on the product states p AB ~ p ® p B 
(which are the only states without any correlations), and 
is non-increasing under local operations. Also, it is 
known to be asymptotically continuous [5]. Further- 
more, it has an operational interpretation as a cost mea- 
sure. Namely, it was shown in [5] that the entanglement 
cost of preparing many copies of a bipartite state p AB , 
with the restriction that only a vanishing rate of commu- 
nication is allowed, denoted -Elo, (p)> equals the regular- 
ized entanglement of purification: 

E ZOq (p)= lim -E P (p^)=:E^(p). 

(That a communication Q(s/n) is sufficient and nec- 
essary, even for pure states, was shown by Lo and 
Popescu d and in flU.) 

Hayashi proved that the optimal visible compression 
rate for mixed states is equal to Ep° of a state associated 
to the ensemble [ 10]. More generally, the regularized en- 
tanglement of purification characterizes the communi- 
cation cost of simulating a channel without prior entan- 
glement fllll (in contrast to the Quantum Reverse Shan- 
non Theorem). Furthermore, in [12, Theorem 2] entan- 
glement of purification, or rather its regularization, was 
linked to the maximum advantage a given mixed state 
yields in dense coding. 

However, it is not known how to evaluate the regular- 
ized entanglement of purification. As a matter of fact, it 
is still an open question whether entanglement of purifi- 
cation is additive, i.e. 



E P {p MB '®a A ^) = Ep{p^ 
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Clearly, a positive answer to this question would imply 
Ep, and thus a single-letter formula for E^o q (p)- 
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Recently, several similar-looking entanglement quanti- 
ties and capacity-like measures were shown to be non- 
additive Ill3l - ll7ll . and so one might speculate that the an- 
swer to the above question is negative, too. However, 
these constructions do not seem to imply anything di- 
rectly for entanglement of purification. 

Remark 2 E P (ip AB ) = S(ip) for pure states ip = \tp)(ip\, 
and on product states, Ep(p A ® p B ) = 0, so additivity holds 
for these two classes H/. 

In ilgl/ it was shown more generally that Ep{p AB ) = 
S(p A ) whenever the (pure or mixed) state p is supported ei- 
ther on the antisymmetric or the symmetric subspace ofA®B, 
with \A\ = \B\. So additivity holds for all such states, too. 

In the present paper, we prove results which strongly 
suggest that entanglement of purification may not be ad- 
ditive. In Section HU we will introduce a new property 
of the regularized entanglement of purification, which 
can be expressed as the convexity of the difference be- 
tween regularized entanglement of purification and the 
entropy of the state, Ep°{p) — S(p). Then, in section [TTT1 
we investigate numerically the functional Ep(p) — S(p) 
for the one-parameter family of Werner states on two 
qubits: since we find that the latter is not convex, we con- 
clude (except for gross numerical error) that entangle- 
ment of purification is different from its regularization. 
Indeed, our convexity result implies an upper bound on 
Ep° which is much smaller than our best estimate for 
Ep on certain Werner states. Finally, in section [IV] we 
conclude, highlighting some open questions. 

II. A CONVEXITY PROPERTY OF REGULARIZED 
ENTANGLEMENT OF PURIFICATION 

Here we state our main result, a new property of the 
regularized entanglement of purification: 

Theorem 3 For a decomposition p AB = ^ . Pip AB as an en- 
semble of possibly mixed states p ir 

e?{p ab ) < E^p(pf) + x(te;ft}), 

i 

where x = x({PH Pi}) = S{Y,iPiPi) ~ T,iP* s (pd is the 
Holevo information (cf. u9il ). 

Proof We shall describe an asymptotic protocol for cre- 
ating p® n , using asymptotically optimal ways of gen- 
erating pf ki (with ki « npi) as subroutines. In the 
protocol, the term ^ PiEp^ '{p AB ) will be naturally vis- 
ible as the rate of entanglement used, while x({Pi> Pi}) 
will emerge as the rate of classical shared randomness 
(which of course can be obtained from entanglement at 
rate 1 by measuring). 
To be specific, we have 

i n — ?'i 22 . . .i n 



wither, = p i± pi 2 ■ --p in and pjn = p il <E>p l2 <E)- ■ -<E>p ln . For 
a string i n = i\i2 ■ . ■ i n let k(i\i n ) count the number of 
occurrences of i. Then, define the set of typical indices, 

T:={i n :Vi \k(i\i n ) - Pl n\ < 5n} . 

Below we outline the argument to show that there exists 
a family of indices, i n (l), . . . , i n (K) e T, K = 2 n ^ +5 \ 
such that 

K 1 

i=i 

the approximation being asymptotically perfect in trace 
norm. Then the protocol to create p® n goes as follows: 
The two parties use n(x + 5) ebits to create the same 
number of shared random bits; these are used to sample 
a uniformly random i n (j), j = 1, . . . , K. Then for each i, 
they invoke the given protocols to generate fcj = kri\i n ) 
copies of pi, using ki(Ep°(p.i)+5) ebits and LO g , thus cre- 
ating an approximation to p^ rj) . The total entanglement 
consumption of this protocol is 

< n( X + S) +nJ2p i Ep*(p l ) + nS + nSlog\A\, 

i 

which is what we want, since S > can be made arbi- 
trarily small. 

The set {i n (l), ■ ■ . , i n (K )} is shown to exist by the 
probabilistic method: Indeed, we draw the i n (j) i.i.d. ac- 
cording to the distribution := on T, with 
Q = P n (T) the probability of finding a random string 
i n in the set T ■ The core part of the proof of the main 
theorem in 1201] (Theorem 2, specifically p. 163) shows 
that this works. The same technique was used again 
in ||2TI Proposition 2], incidentally in a different attempt 
to quantify total correlations in a quantum state. Here 
we give only a summary outline. 

We need to introduce some more "typicality" notation 
(cf. |H for more details and properties of these notions): 
The typical projector II of p® n is 

pj ._ )^£-nS{p)—&' n <- p®n <- <£- nS(p)+S'n\ 

the spectral projector corresponding to the typical eigen- 
values of p® n . Finally, the conditional typical projectors 
of the states p^ — p^ ® pi 2 <g> • • • <g> pi n : 

Uin := ^2- nB - s ' n < p^ < 2-"^+ <5 '™| , 
where S = ^Z i PiS{pi). Consider now the operators 

p' fn :=nn,, ftn n, n n, 

which have the property that for every 6' > one can 
choose 5 > 0, such that for large enough n and all i n e T, 

lift- -P^ || 1 < o(l). Thus, 

p {n)l ■■= E 
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is supported on the typical subspace and defined such 
that \\p® n - < o(l). Define 



as the spectral projector corresponding to the "large" 
eigenvalues of /)'"''. 
Finally let 

p:=n' P 'n' = 

wither^ =Wp' in W. 

Now, observe that, restricted to the support of II', and 
for i n € T, 

p > 2~ nS (' ) )~' 5 '"n', 



nr. < 2 



-nS-\- 5' n 



In this situation we can apply the operator sampling 
lemma in [22] and conclude that with high probability, 
i n (l), . . . , i n {K) £ T are such that for large enough n, 

K < 2"(x+3< 5 ') and 



1 K 



3 = 1 



<o(l), 



hence similarly the same for the distance of the analo- 
gous sum over the Pi^(j), from p® n . Since 8' > was 
arbitrary, this concludes the proof. □ 

For our present purposes, we rearrange the terms in 
the above theorem: 

Corollary 4 For p AB = Y.Vtpf B , 

i 

E^(p AB )-S(p AB )<^(Ep( P , 



AB\ 



S{ P f B )) 



In other words, Ep°(p) — S(p) is a convex function of p. □ 

Thus if we can find some examples to show Ep(p) — 
S(p) is not convex on quantum states, then Ep° can not 
be equal to Ep, which will prove that Ep is not additive 
on some states. In the following section, we will present 
the numerical results for two-qubit Werner states (repli- 
cating essentially the study of OJ), which indicate that 
entanglement of purification is not additive. 



III. TWO-QUBIT WERNER STATES 

Here we are considering the two-qubit Werner states, 
arguably the simplest family of states not covered by the 
additivity results mentioned in Remark[2] 



W(f) :=/*„ + (!- /)-(!-*„), 



with the maximally entangled singlet state *o and < 
/ < 1 (the singlet fraction). 

Already in the original EoP paper @, the authors per- 
formed a numerical minimization with |v4'|,|i?'| < 4, 
which thanks to |6[] we know to be sufficient to find 
Ep(W(f)) - see Fig. [T] One way of looking at the min- 
imization that one has to perform is as follows: Diago- 
nalizing the state, p = Xa=o -M^jX^il/ where j^j) are 
the four Bell states, starting with the singlet |^o)/ an d 
Ao = e, Ai = A2 = A3 = ijp. Then we can write a 
standard purification 



\<P) 



ABA' 



A' 



and any other purification \tp) £ ABA 1 B' of p we can 
obtain as 



W) 



ABA' B' 



(t AB ®V)\y) 



ABA' 



(1) 



with an isometry V ■ A' A'B', described by 64 
complex numbers (subject to normalization and orthog- 
onality constraints, effectively leaving 30 + 29 + 27 + 
25 = 121 independent real parameters). Note that one 
could extend the isometry to a unitary U on A'B', with 
U\(j>) A |0) s = V\4>) - however, this introduces a large 
number of spurious variables, in fact more than dou- 
bling them to 256, which have no impact on the objective 
function. 

The graph shows an apparent - concave! - kink (dis- 
continuity of the first derivative) at / w .005. Note that 
if the kink was real, we had achieved our goal, since the 
entropy S(W(f)) is a smooth function on the open in- 
terval (0, 1), hence the difference A(/) = E P (W{f)) - 
S(W(f)) could not possibly be convex as a function of 

Motivated by this observation, we did a re-calculation 
for < / < .01. This revealed that the first regime, 
where Ep(W(f)) ps 1, is smaller than it was observed 
in OJ; the range we determined is about [0, .004] j23Tl , al- 
though the deviation is tiny. However, we still see the 
change from a regime where the function is almost con- 
stant 1 to one where it decreases sharply with /. In Fig|2] 
we show A(/) and one can see that indeed it is not con- 
vex. 

We should point out that by using standard mini- 
mization algorithms (local descent with various, usu- 
ally random, starting points), we cannot calculate the 
exact value of entanglement of purification: What these 
methods give us are at best local minima. However, 
we can treat the local minima from numerics as upper 
bounds on the entanglement of purifications, since the 
algorithm finds concrete feasible points with certain val- 
ues of the objective function to be minimized: 

Ep(W(0)) = 1 and E P {W{M)) < .9226, 

showing via theorem|3]that J B~(M/(.005)) < .9663. 



4 




FIG. 1. The numerical results of for E P (W(f)). Note 
that the only values known rigorously are at / = and / = 
1 (both 1) and at / = j (0). Four different regimes were 
observed numerically. In the first regime, which only extends 
over a very small range, approximately < / < .005, the 
optimal V of eq. (JTJ seems to be the trivial \(f>) A n> \<f>) A |0) s . 
Thus on this short interval, Ep(W (f)) = 1. In the second 
regime (roughly .005 < / < .25), entanglement of purification 
appears convex and steeply decreasing with /. 



Put differently, if E P (W (.005)) > .9663, we will have 
A(0.005) > ^A(O) + iA(.Ol), 

i.e. non-convexity of Ep(p) — S(p), and thus non- 
additivity of entanglement of purification. 

The numerics suggests Ep(W (.005)) > .99, not even 
coming close to the above value of .9663. Hence, unless 
there is some deep and narrow "crevasse" in the land- 
scape of the function E(ip AA :BB ), hiding the true mini- 
mum value, we are forced to conclude that Ep° (W(.005)) 
is strictly smaller than E P (W (.005)). 



IV. DISCUSSION AND CONCLUSIONS 

Our main new contribution to the study of entangle- 
ment of purification, and its regularization Ep° = E^o , 
is theorem [3] A special case of its application is when 
p AB is decomposed into product states, meaning that 
Ep(pf B ) = 0. Then, the protocol described in the proof 
of the theorem uses only shared randomness, at rate 
yflPii Pi})- This generalizes a result due to Wyner l24ll 
(cf. B25I1 for a more modern account) on the creation of 
a bipartite distribution Pxy by local operations (noisy 



-0.58 



-0.6 



-0.62 



3 -0.64 



s -0.66 



-0.68 



-0.7 



-0.72 




'1 23456789 



X ^o' 3 



FIG. 2. A(/) = E P (W(f)) - S(W(f)) for < / < .01. 

channels) from limited shared randomness: 

w(Pxy) = minI(V : XY) 

s.t. X — V — Y is a Markov chain, 

where I(V : XY) = H(XY) - H(XY\V) is the Shannon 
mutual information. 

Since the theorem puts a nontrivial bound on the reg- 
ularized entanglement of purification, expressed conve- 
niently as the convexity of Ep° — S, we could use it to 
probe the additivity of entanglement of purification. We 
find that, apart from the possibility of a gross numer- 
ical error, entanglement of purification is non-additive 
already on certain two-qubit Werner states. Interest- 
ingly, we can only say that for some sufficiently large 
n, ^E P (p® n ) < Ep(p), but our proof of theorem does 
not yield directly an estimate for this n; in any case, we 
may expect it to be rather large. 

The non-additivity of entanglement of purification 
also answers a question from 1121 : Indeed, our results 
imply the non-additivity of the "quantum advantage of 
dense coding" on some states, via their monogamy iden- 
tity [12, Theorem 2]. 

To come back to our Werner state example: Of course, 
it would be most desirable to remove the need for nu- 
merical calculation in the argument. We leave a com- 
pletely rigorous proof of the non-additivity of entangle- 
ment of purification to future work; noting only that 
since our example is concrete, and we have a concrete 
benchmark, 

E P (W(.005)) ^ .9663, 
this could be accomplished in principle by discretization 
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and exhaustive search over the parameter space. The 
reason we have not done this is that such a brute force 
approach is too CPU intensive for practical desktop PC 
calculations. 

In a similar vein, we would like to find explicit states 
p and a with 

E P (p AlBl ® a A2B2 ) ^ E P {p AlBl ) + E P (a A2B2 ). 

To end, we remark that our study does not impact on 
the possible non-additivity of Ep° = E L Q q/ which we 
recommend to the reader as an interesting problem in 
itself. Even more interesting however is the problem 
of finding a tractable (or even "single-letter") expres- 
sion for Ep°, which in a certain sense would generalized 
Wyner's beautiful answer for the classical randomness 
cost of probability distributions Pxy HHII1]- 
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